A toolkit for optimising parallel performance

نویسندگان

  • Alistair N. Dunlop
  • Emilio Hernández
  • Oscar Naim
  • Anthony J. G. Hey
  • Denis A. Nicole
چکیده

Three interacting tools to assist distributed memory programmers in developing, optimising and understanding application performance have been developed. These tools perform automatic code generation from an initial workload speciication, performance prediction using memory hierarchy simulation, and performance visualisation for distributed memory message passing applications. Their combination facilitates extensive performance tuning from initial workload speciication through to completed message passing program codes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Measurement and Modeling with the Lost Cycles Toolkit

Although there are many situations in which a model of application performance is valuable, performance modeling of parallel programs is not commonplace, largely because of the diiculty of developing accurate models of real applications executing on real multiprocessors. This paper describes a toolkit for performance tuning and prediction based on lost cycles analysis. Lost cycles analysis deco...

متن کامل

A Linear Algebra Formulation for Optimising Replication in Data Parallel Programs

In this paper, we present an efficient technique for optimising data replication under the data parallel programming model. We propose a precise mathematical representation for data replication which allows handling replication as an explicit, separate stage in the parallel data placement problem. This representation takes the form of an invertible mapping. We argue that this property is key to...

متن کامل

Benchmarking and Optimising Software for Improved Multiprocessor Performance

Parallel computers have become widely available with the introduction of multicore processors, however software will need to be optimised to make use of the additional processors. A number of tools can aid the process of optimising code for parallel execution, such as parallel debuggers, profilers and parallel libraries. This paper describes an approach to evaluating the effectiveness of a sele...

متن کامل

A toolkit for parallel image processing

In this paper, we present the design and implementation of a parallel image processing software library (the Parallel Image Processing Toolkit). The Toolkit not only supplies a rich set of image processing routines, it is designed principally as an extensible framework containing generalized parallel computational kernels to support image processing. Users can easily add their own image process...

متن کامل

The semi-automatic parallelisation of scientific application codes using a computer aided parallelisation toolkit

The shared-memory programming model is a very effective way to achieve parallelism on shared memory parallel computers. Historically, the lack of a programming standard for using directives and the rather limited performance due to scalability have affected the take-up of this programming model approach. Significant progress has been made in hardware and software technologies, as a result the p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995